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Abstract. 

Much of scientific progress now hinges on the reliability, falsifiability and repro- 
ducibility of computer source codes. Astrophysics in particular is a discipline that to- 
day leads other sciences in making useful scientific components freely available online, 
including data, abstracts, preprints, and fully published papers, yet even today many 
astrophysics source codes remain hidden from public view. We review the importance 
and history of source codes in astrophysics and previous efforts to develop ways in 
which information about astrophysics codes can be shared. We also discuss why some 
scientist coders resist sharing or publishing their codes, the reasons for and importance 
of overcoming this resistance, and alert the community to a reworking of one of the first 
attempts for sharing codes, the Astrophysics Source Code Library (ASCL). We discuss 
the implementation of the ASCL in an accompanying poster paper. We suggest that 
code could be given a similar level of referencing as data gets in repositories such as 
ADS. 



1. Introduction 

The importance of scientific cod es has increased; indeed, this importance is considered 
a fact of life dWeiner et al.ll2009h and is continually being discussed in the literature^. 



Many examples of public codes now exist that have become industry standard software, 
such as Sextractor, CLOUDY and GADGET to name a few. 

In some fields (e.g., bioinformatics) journals in clude software used t o generate 



results with their articles or require it be submitted. iGray & Mannl (1201 It) claim the 
astrophysics community is not there yet, but scientists are encouraged t o release thei r 
codes, that their codes are good enough to release even if messy or rough (Ba rnesll2010h . 



Scientists may see their codes, or their research teams' codes, as proprietary and thus 

refrain from publishing them. 

Appropriate software is often equally important dGrosbol & Todyll2Q10h as data are 
to research, and Weiner, et al. (2009) state that useful public software packages have 
enabled easily as much science as yet another large telescope would have. Though the 
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NSF was specifically addressing cyber-infrastructure with the statement that its strate- 
gic plan defines research infrastructure as including investments in experimental tools, 
we believe a case c an be made for scientific codes fitting within this strategic goal 
(IStewart et alJl201Ch . 



2. A Brief History of Source Codes 

In the early years of computational astrophysics, several important codes were devel- 
oped but made available only to the social communities surrounding their developers. 
These social communities would typically include close collaborators, graduate stu- 
dents, postdoctoral fellows, and the graduate students of close collaborators. 

An example of this is the "Wilson-Devinney" code that models eclipsing binary 
star systems and their observable light curves. A first version of this code was written 
by Robert Wilson in or before 197 1. The Wilson-Devinney code has been upgraded and 
adapted numerous times. Only recently has this code been made available via anon.ftp. 

Another example is the "Aarseth" code that models gravitational N-body inter- 
actions started around 1960 as one of the participants of the IAU 25 body problem 
"contest". Several versions of the Aarseth code are now made publicly available by its 
primary author, Sverre Aarseth through his web site. 



3. Previous Online Efforts 

With the rise of the internet, packages such as AIPS, IRAF, GIPSY quickly became 
available to the community. 

In the 1980s and 1990s, several prominent astrophysics codes were released to the 
public over specific web sites. Most of these were not associated with a specific scien- 
tific paper. Many of these sites still exist today. The primary way one found out about 
codes like these was through a mentor or collaborator. In the general field of comput- 
ing, websites sprang up providing registration or repository service (e.g. freshmeat, 
sourceforge, github, google.code) 

Two of the earliest code collections in astronomy were Astro WEB and ASDS, but 
neither are maintained anymore (though Astro WEB is still available at NRAO). 

In 1999, Nemiroff and Wallin founded the online Astrophysics Source Code Li- 
brary (ASCL) to house codes of use to the community, eventually resulting in a library 
of 37 codes, all of which had been described in the literature and used to produce re- 
search published in or submitted to refereed journals. The last code was added in 2002. 
The ASCL site also linked to other code libraries, most of which no longer exist or have 
not been updated in years. 

SkySoft was created in 2001 by C.Baffa, E.Giani, and A.Checcucci. This site is 
intended to be a site which is community-supported, accepting codes and comments 
from coders and code users. It also features recent news on topics of interest to as- 
tronomer programmers, such as notices of upcoming conferences and workshops for 
this community. The majority of its code entries date from 2003, with some additional 
codes from later years. 

The Astroforge project was modeled after the wildly successful SourceForge for 
open source software but focused on the needs of astronomers (Remijan, Brunner, 
Tillery, & Haider, 2003) and existed for three years. 
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We know of three other independent code information repositories, though cer- 
tainly there are many project groups and individuals who pull together such information 
for their own work, team, or subspecialty. One such subspecialty repository is AstrO- 
matic for astronomical pipeline software; another is the codes wiki for computational 
fluid dynamics 

The Astro-Code Wiki created by AstroSim - European Network for Computational 
Astrophysics contains 54 codes and has been updated recently. AstroSim was intended 
to be a five-year project to bring together European computational astrophysicists run- 
ning from October, 2006 until September, 201 1; its focus is on comparison of codes for 
suitability for specific tasks. 

Another repository called Astro-Sim houses about thirty codes, and provides fo- 
rums for discussion and lin ks to other tools and libraries. Similarly, AstroShare, dis- 
cussed by IShortridgel (120091) . also houses about thirty codes and allows for discussion 
of topics such as releasing software, social media, and middleware. 

4. To release or not to release source code 

Previous endeavors, including the first incarnation of the ASCL, had not grown as codes 
have proliferated. Indeed, some scientists are not in the habit of, are reluctant to, or 
openly resist making their codes available to non-collaborators. A look at popular and 
academic literature, our correspondence and conversations with scientist coders, infor- 
mal surveys, and our experiences demonstrate the variety of reasons some scientists 
have for not releasing codes. 

Intellectual property issues: The workplace or granting institutions may place restric- 
tions on sharing code. 

Codes reflect the reality of their creation: Code is often "quick and dirty"; because 
it is messy, a coder may be reluctant to release it. Codes also can have a narrow focus, 
and the author(s) doesnt seem it suitable for anything else. 

Releasing codes is not standard practice, useful to ones career, necessary, nor de- 
sired: it is a fact of life that coding does not get you brownie points. 
Releasing codes places demands on the coder, and released codes may be exam- 
ined too closely and used inappropriately: programmers may be worried there might 
be bugs in their code (Barnes, 2010). This is again a problem of the lack of time to 
check code results to some subjective level of increasingly greater tolerance when this 
time could be used to write more papers and again advance in the "publish of perish" 
dilemma. 

4.1. Why codes should be released 

Despite these arguments and absent any national security concerns, we believe it is in- 
cumbent upon scientists to release their codes. If a code does the job it was designed 
to do, it does not matter that the code may be messy, undocumented, or cobbled to- 
gether and inelegant; as a tool used to produce results, the code should be available for 
examination and study, just as any research protocol is. 

We are not alone in this belief. TimmeiQ laments that the reliance on computational 
methods in the sciences has scientists giving up on a key component of the scientific 
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method: reproducibility. A lack of transparency and reproducibility undermines public 
confidence in science as well as slowing scientific progress, engendering a credibility 
crisis, according to StodderE] who is working to develop the Reproducible Research 
Standard. 

The NSF has made a recommendation that reproducibility should be promoted, 
and states that data and software used in the development of a scientific publication 
should be escrowed or arc hived where they can be examined and re-verified when 
needed dStewart et al.ll2010t) . 



5. ASCL 

We have implemented a new way to provide a large set of peer-reviewed described 
codes from the comm unity in an easily a ccessible place. The details are described in our 
accompanying paper (lAllen et al.ll2012h . An additional outcome of such a repository 
could be a referencing database for astrophysics code, similar to the newly established 
one for data that has b een added to the ADS, cf. discussions during a BoF session 
(lAccomazzi et al. I l2012h . 

NOTE ADDED IN PROOF: ASCL codes are now incorporated into ADS. 
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